Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics

نویسندگان

Shingo Yamade

Kanako Matsunami

Akira Baba

Akinobu Lee

Hiroshi Saruwatari

Kiyohiro Shikano

چکیده

Noise and speaker adaptation techniques are essential to realize robust speech recognition in real noisy environments . In this paper, we applied spectral subtraction to an unsupervised speaker adaptation algorithm in noisy environments. The adaptation algorithm consists of the following five steps. (1) Spectral subtraction is carried out for noise added database. (2) Noise matched acoustic models are trained by using noise added speech database. (3) HMM sufficient statistics for each speaker are calculated from noise added speech database, and stored. (4) According to one arbitrary utterance, speakers close to a test speaker are selected by using speaker GMMs. (5) Speaker adapted acoustic models are constructed from HMM sufficient statistics of the selected speakers. We evaluated our unsupervised speaker adaptation algorithm in noisy environments in the 20k dictation task. The recognition experiments show that our speaker adapted acoustic model can achieve 82% word accuracy in 20dB SNR, which is about 6% higher than that of the noise matched models trained by Forward-Backward algorithm. We also investigated the robustness of the adapted models in various SNR conditions. Integration with the supervised MLLR is also examined.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments

This paper describes a multi-template unsupervised speaker adaptation based on HMM-Sufficient Statistics. Multiple class-dependent models based on gender and age are used to push up the adaptation performance while keeping adaptation time within few seconds with just one arbitrary utterance. Adaptation begins with the estimation of speaker‘s class from the N-best neighbor speakers using Gaussia...

متن کامل

Improved HMM Separation for Distant-Talking Speech Recognition

In distant-talking speech recognition, the recognition accuracy is seriously degraded by reverberation and environmental noise. A robust speech recognition technique in such environments, HMM separation and composition, has been described in [1]. HMM separation estimates the model parameters of the acoustic transfer function using adaptation data uttered from an unknown position in noisy and re...

متن کامل

Speech recognition in noisy environments using first-order vector Taylor series

Ž . In this paper, we generalize relations between clean and noisy speech signal using vector Taylor series VTS expansion Ž . for noise-robust speech recognition. We use it for both the noisy data compensation and hidden Markov model HMM parameter adaptation, and apply it for the cepstral domain directly, while Moreno used it to estimate the log-spectral parameters. Also, we develop a detailed ...

متن کامل

Speaker adaptation in noisy environments based on parameter estimation using uncertain data

This paper describes new method for the speaker adaptation of HMM parameters in environments with background noise. This method is based on Bayesian estimation, and calculates the a posteriori distribution of cleanspeech HMM parameters from their a priori distribution by using noisy speech observations. The advantage of the method is that the distribution of the noise can be taken into account ...

متن کامل

Text-Independent Speaker Verification for Real Fast-Varying Noisy Environments

Investigating Speaker Verification in real-world noisy environments, a novel feature extraction process suitable for suppression of time-varying noise is compared with a fine-tuned spectral subtraction method. The proposed feature extraction process is based on approximating the clean speech and the noise spectral magnitude with a mixture of Gaussian probability density functions (pdfs) by usin...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2002

Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics

نویسندگان

چکیده

منابع مشابه

Rapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments

Improved HMM Separation for Distant-Talking Speech Recognition

Speech recognition in noisy environments using first-order vector Taylor series

Speaker adaptation in noisy environments based on parameter estimation using uncertain data

Text-Independent Speaker Verification for Real Fast-Varying Noisy Environments

عنوان ژورنال:

اشتراک گذاری